DIMENSIONS OF TIGHT SPANS 



MIKE DEVELIN 

Abstract. Given a finite metric, one can construct its tight span, a geometric object representing 
the metric. The dimension of a tight span encodes, among other things, the size of the space of 
explanatory trees for that metric; for instance, if the metric is a tree metric, the dimension of the 
tight span is one. We show that the dimension of the tight span of a generic metric is between [^] 
and |_§J , and that both bounds are tight. 



1. Introduction 

Let d be a metric on a set of n points labeled with the elements of [n] := {1, • • ■ ,n}, i.e. a 
function 

d : (^2^ ~^ ^ + such that dij + djk > dik for all i,j, k 6 [n]. 

The injective hull |3] or tight span of the metric is a geometric object encoding it, generalizing 
the corresponding tree in the case of a tree metric pQ; Dress, Huber, and Moulton j^j showed that 
it is given by the complex of bounded faces of the polyhedron 

P d = {x € (M + ) n | Xi + Xj > dij, 1 < i < j < n}. 

In [3] , Sturmfels and Yu observed that this is polar to the complex of interior faces of the regular 
subdivision of the hypersimplex 

A(n, 2) := conv {a + e,- | 1 < i < j < n} C R n 

given by lifting a vertex + e.j to height dij and taking the complex of upper faces of the resulting 
polytope. 

Another formulation of the tight span mentioned in [I] is the following. A metric corresponds 
to assigning a weight dy to each edge ij of the complete graph K n ; a subgraph G corresponds to 
a cell of Ad if there exists an x € M n satisfying 

Xi + Xj = dij if ij € G 
Xi + Xj > dij if ij G. 

The vector (x, 1) gives the coordinates of the hyperplane defining the corresponding upper cell in 
the lifted hypersimplex. 

As a polyhedral complex, the tight span has a dimension. This dimension measures the com- 
binatorial dimension of the metric; for instance, if the metric is a tree metric (so that it can be 
realized by placing the n points on a tree with weighted edges and taking the resulting distances), 
the dimension of the tight span will be one. In a sense, the dimension measures how far the metric 
is from being a tree metric. 

It is easy to express dimension in terms of the hypersimplex formulation, due to the equivalence 
of complexes outlined above. In particular, the dimension of the tight span is equal to the maximal 
codimension of any interior cell of the corresponding triangulation. 
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This dimension for arbitrary metrics can be any number between one and |_§J- It makes sense 
to restrict to metrics satisfying the following genericity condition. 

Definition 1.1 (;1J). A metric is generic if each cell of the corresponding subdivision o/A(n, 2) is 
a simplex. 

For instance, this definition forbids such non-generic behavior as d\2 + ^34 = d\z + c?24 5 which 
corresponds to four points alignable on a tree. The genericity condition consists of being in a full- 
dimensional cell of the metric fan [1] which partitions all metrics into combinatorial equivalence 
classes of tight spans. 

Computation up to n = 4 is easily done by hand; all generic metrics on 2 or 3 points have 
combinatorial dimension one, while all generic metrics on four points have combinatorial dimension 
two. For n = 5, all generic metrics have combinatorial dimension two PP, but for n = 6, Sturmfels 
and Yu showed that, surprisingly, generic metrics do not all have the same dimension; they can 
have dimension two or three. 

In this paper, we complete the classification of combinatorial dimensions of metrics, proving the 
following theorem. 

Theorem 1.2. The combinatorial dimension of any generic metric on n points lies between \^~\ 
and [§J, inclusive. Both bounds are tight. 

Along the way, we present a connection to integrality of a certain linear program, and develop 
a theory of what the faces in a triangulation of a hypersimplex look like. Note that Theorem 11.21 
implies a corresponding result about triangulations of hypersimplices, namely that any such trian- 
gulation has its smallest interior cell of codimension between \^~\ and |_§J- 

2. Linear Programming 

Take a generic finite n-point metric d in its graph representation, i.e. an edge-labeled K n with 
labels satisfying the triangle inequality. Let < u £ M n be a nonnegative vector (in practice, 

usually integral), and denote by |u;| half the L -norm of to, Ep-. Then we make the following 
definitions. 

Definition 2.1. A fractional w-matching c is an assignment of a weight dj > to each edge of K n 
such that u>i = for all i. If to = (1, . . . , 1), we call this a fractional 1-matching. The support 

supp{c) of a fractional co-matching is the set of edges ij with c^ > 0. 

Basically, to represents the desired valences of the vertices, and a fractional w-matching is an as- 
signment of (possibly non-integral) multiplicities to the edges to create a graph with those valences. 
We will be especially concerned with the following fractional w-matchings. 

Definition 2.2. A fractional to-matching c is called LP-optimal if c ■ d = c ijdij is maximal 
among all fractional to-matchings. 

These LP-optimal fractional w-matchings are the key actors in our investigation. For one, they 
correspond to cells in the corresponding subdivision. 

Proposition 2.3. Let c be an LP-optimal fractional to-matching. Then supp(c) is a cell in the 
corresponding subdivision. Furthermore, all cells arise in this fashion. 

Proof. Consider the point x = ^ € A(n, 2). In the lifted hypersimplex, the points (x, h) come from 
taking convex combinations of lifted vertices to get x; these are precisely fractional j^r-matchings. 

Since c is LP-optimal, t^t is also, and thus it maximizes h over all such linear combinations. 
Therefore, its support set is a subset of the face containing x in its interior. Since d is generic, this 
face is a simplex, so supp(c) is itself a cell as desired. 
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For the converse direction, take any cell, and take to a point in its relative interior. Then we can 
write to as a convex combination of all of its vertices, and the corresponding fractional cj-matching 
given by the coefficients of this convex combination will be LP-optimal, since the lifted relevant 
vertices are all in the same face of the upper envelope, meaning that combining them yields a point 



Therefore, we only need to investigate LP-optimal fractional w-matchings in order to determine 
the cells of the corresponding subdivision. We now prove a series of propositions connecting the 
two. Our first two propositions connect these matchings to the problem we are using them to solve. 

Proposition 2.4. Let G be the support of an LP-optimal fractional to-matching. Then the corre- 
sponding cell is interior if and only if G is a spanning subgraph of K n which is not a Ki n —i. 

Proof. If G is not a spanning subgraph, then there exists some vertex i which is not in any of its 
edges. The corresponding cell then lives in the facet-defining hyperplane X{ = of the hypersimplex. 
Similarly, if G is a -fTi.n-i with the special element i, then the corresponding cell lives in the facet- 
defining hyperplane X\ = 1 of the hypersimplex. 

Conversely, if G is a spanning subgraph which is not a Ki n —i, then taking the average of the 
vertices of the corresponding cell, we obtain a point in the cell with all entries different from zero 
and one, which lies in the interior of the hypersimplex. Thus the cell must be an interior cell. 



Proposition 2.5. Let G be the support of an LP-optimal fractional to-matching. The dimension 
of the corresponding cell of the triangulation o/A(n, 2) is one less than the number of edges of G. 

Proof. The number of vertices of the corresponding cell is equal to the number of edges of G by 
definition. Since the metric is generic, all cells are simplices, so the dimension of this cell is that 



We now know what statistic to investigate to solve the problem: we need to determine the 
minimum number of edges in a spanning support of an LP-optimal fractional w-matching. We now 
embark upon the investigation of such graphs. 

Proposition 2.6. Let G be the support of an LP-optimal fractional to-matching. Then G has no 
nontrivial even tours, where a tour is a sequence of edges i^jk with jk = ik+i beginning and ending 
at the same vertex, and a tour is trivial if every edge appears at least twice. 

Proof. Suppose G had a nontrivial even tour. Let the corresponding vertices of the hypersimplex 
be vi, Wi, ■ ■ ■ , v n , uu n . Then v\ + . . . + v n = w\ + . . . + w n is a nontrivial affine dependence among 
the vertices of the corresponding cell, so it this cell is not a simplex, contradicting the fact that d 
is generic. □ 

Therefore, all connected components of such a graph must either be trees or have exactly one 
cycle of odd length. 

Proposition 2.7. Given a vector to, there exists a unique LP-optimal fractional to-matching. 

Proof. Suppose there are two distinct LP-optimal fractional w-matchings c\ and c<i- Then we can 
write the point -fa in two different convex combinations, each of which lifts to an upper-envelope 
convex combination. This yields an affine dependence among the vertices of the face given by the 
LP-optimal fractional cu-matching C1 \ C2 (i.e. supp(ci) U supp(c2)), a contradiction since it would 
then not be a simplex and d is generic. □ 

This means that any to corresponds to a unique LP-optimal fractional cj-matching. The next 
proposition establishes a sort-of converse. 




□ 



□ 



number minus one. 



□ 
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Proposition 2.8. Let G be the support of an LP- optimal fractional (^-matching r. Then all frac- 
tional LO-matchings (for all lo ) with support G are LP-optimal. 

Proof. Suppose not. Then we have some c a fractional w-matching with support G and d a 
fractional ^-matching with d ■ d > c ■ d. Take e small enough so that e ■ c < r. Then ed ■ d > ec- d, 
so (ed — ec + r) • d > r ■ d. Since ed and ec are both w-matchings, ed — ec + r is a 4> matching, 
contradicting LP-optimality of r. □ 

In the hypersimplex language, this is saying that if some convex combination of lifted points is 
in the upper envelope, then every convex combination of those points is, a routine statement. Our 
next proposition gives the associated linear program. 

Proposition 2.9. Let G be any graph, and let lo be given by letting LOi = deg^G). Then G 
corresponds to a cell if and only if the linear program given by maximizing c ■ d on the polytope 
given by 

dj > Mi , j ; ^2 °ij = u i 
has (the indicator function of) G as its optimal vertex. 

Proof. G corresponds to a cell if and only if there is some (^-matching r with support G. By 
Proposition 12.71 this holds if and only if G itself is LP-optimal, which amounts to the statement of 
the theorem. □ 

The final proposition of this section presents a criterion for genericity in terms of LP-optimality. 

Proposition 2.10. A metric d is generic if and only if its supports of LP-optimal fractional lo- 
matchings (for all lo) all have no nontrivial even tours. 

Proof. Proposition 12.61 provides one direction of the proof. For the other direction, suppose d is 
not generic. Take some signed affine dependence C among the vertices of a face, which we can 
write as u := Yliei c i v i = J2jejbjVj with I n J = and = Yl^j = Then c and b are 
both LP-optimal fractional a;-matchings; c + b is also, supp(c) and supp(fr) are disjoint spanning 
subgraphs of i^ S upp(«j) (the complete graph on the coordinates of lo which are nonzero) , so their 
union, which is supp(c+6), must contain a nontrivial even tour (alternately take edges from supp(c) 
and supp(fr) until we complete a tour.) 

□ 

In the next section, we leverage this LP-theory to make statements about the dimension of the 
tight span of a generic metric d. 

3. Dimension of tight spans 

In this section, we prove Theorem 11.21 We start by proving the indicated bounds. 

Theorem 3.1. Let d be a generic metric. Then the tight span of d has dimension at least \^~\ and 
at most |_§J 

Proof. The dimension of the tight span is equal to the maximal codimension of an interior sim- 
plex in the corresponding triangulation of A(n,2). Since this polytope is [^-^- J -neighborly, this 
codimension can be at most (n — 1) — [-^"J = |_§J ■ 

For the lower bound, we use the LP-theory developed in the previous section. Let c be the 
LP-optimal fractional 1-matching, which is unique by Proposition 12.71 We claim that the con- 
nected components of supp(c) are all odd cycles and isolated edges. Suppose that some connected 
component of supp(c) has a leaf i connected to only one vertex i' . Since 1 = ^ c «i = C H' '■> an d 
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1 = ^2jCi'j = 1 + ^2j^iCi'j, i' must only be connected to i, and that connected component is a 
single edge. 

If a connected component has no leaves, it contains a cycle, which must be odd by Proposition ^. 61 
If G contains any other edges, since it has no leaves, it follows that it must have a nontrivial even 
tour (follow edges not in the cycle until you reintersect the cycle, then follow whichever half of the 
cycle gives you an even total length.) So that connected component must be just an odd cycle, 
completing the proof of the claim. 

The edges in supp(c) form a cell. We claim that we can find a subset of size at most which 
spans G. Indeed, it is immediate that taking all isolated edges and [~|] spanning edges from each 
&-cycle does the trick, as each connected component of this subgraph has either one edge and two 
vertices or two edges and three vertices. By Proposition 12.41 this subset corresponds to an interior 
cell, and its codimension is (n — 1) — (|_ttJ — 1) = [§] as desired. □ 

Whether a tight span has the maximum dimension of ^, for n even, corresponds to testing 
integrality of a linear program, namely whether or not the LP-optimal fractional 1-matching is 
integral. If this matching is integral, its support is a set of ^ edges, which corresponds to an 
interior cell of codimension ^. Conversely, an interior face of codimension ^ corresponds to a 
matching, which must be LP-optimal as otherwise its edges do not form a face. 

In particular, the upper bound of Theorem ll.2l is easy to achieve. 

Proposition 3.2. If d is the metric given by dij = 2 for \i — j\ = [§J and dij = 1 + cty otherwise, 
where the ctij 's are positive numbers smaller than 1/n 2 forming a transcendence basis over Q. Then 
d is generic and the tight span of d has dimension [^J . 

Proof. We need to show first that d is generic. By Proposition I2.1U1 it suffices to show that no 
LP-optimal fractional u-matching c has a nontrivial even tour. Suppose one does; then let the odd 
edges of the tour comprise the (multi)set O, and the even edges E. We can assume that OnE = 0, 
since if O n E ^ 0, the tour breaks up into two even subtours. Furthermore, there is some e such 
that e < Cij for all i,j with the edge ij in O or in E. Since reducing each Cjj with ij € O by e and 
increasing each c^ with i,j € E by e yields another fractional u>matching, and so does the reverse 
operation, we must have J2 ije0 dij = ^2 ijeE dij. 

However, by our choice of the df/'s, the only way such sums can be equal is if all relevant d«'s 
are 2, but it is impossible to form a nontrivial even tour with only 2-edges, since they form disjoint 
edges (and possibly one two-edge connected component.) 

If n is even, the LP-optimal 1-matching is given by the edges ij for j = i + ^. This is spanning 
and has support of the appropriate size. If n = 2k + 1 is odd, the LP-optimal 1-matching is given 
by taking ±(1(1 + k) + 1(1 + 2k) + (1 + k)(l + 2k)) along with the edges ij for j = i + k, 2 < i < k. 
This again has codimension |_§J- □ 

Indeed, for n even, any metric with an integral LP-optimal 1-matching will yield a metric of 
dimension [§J- The condition that the a^'s form a transcendence basis is only needed to imply 
that no subset of them has the same sum as any other subset modulo 1, which is needed for 
genericity. 

The lower bound is trickier to achieve. We give a construction which shows it is tight for n = 3k; 
the construction is easily modified to produce examples for n = 3k + 1 and n = 3k + 2. This metric 
has triples of points which are pairwise far apart, while the distances between points in different 
triples are all small. 
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Proposition 3.3. Let n = 3k. Let d be a metric on [n] given as follows 

dij = 2if^l = [|l 

dij = 1 + ctij otherwise, 

where the ctij 's are positive numbers smaller than 1/n 2 forming a transcendence basis over Q. Then 
d is generic, and the tight span of d has dimension % . 

Proof. We need to show first that d is generic. As in Proposition ^. 21 if d were not generic, we could 
find a nontrivial even tour with Y2ijeO = SijeE ^y'i where O is the multiset of odd edges of the 
tour and E is the multiset of even edges of the tour. Again by our choice of the djj's, the only way 
such sums can be equal is if all relevant d^s are 2, but it is impossible to form a nontrivial even 
tour with only 2-edges, since they form disjoint triangles. 

Next, we need to show that no interior face has dimension greater than ^, i.e. that all supports 
of LP-optimal fractional w-matchings, for uj > 0, have at least edges. 

Suppose we have such a support G. Then by Proposition 12.81 (the indicator function of) G is 
itself LP-optimal, and a spanning subgraph since to > 0. We enumerate the edges as follows. Let 
ai be the number of 2-triangles {3r + 1, 3r + 2, 3r + 3} containing exactly i edges from G. Then G 
contains a\ + 2ai + 303 edges from 2-triangles, and n = 3(ao + a\ + + a%). We now enumerate 
edges of G not in 2-triangles. 

Suppose G contains no edges from a 2-triangle {1,2,3}. Then it contains edges lx\,2x2, and 
3x3, but we must have x\ = X2 as otherwise we can replace the edges lxi and 2x2 by 12 and X1X2, 
to get a better matching, contradicting LP-optimality of G. Similarly, we must have x\ = x%. So G 
contains three intertriangle edges incident upon this 2-triangle, giving us a total of 3ao intertriangle 
edges. 

Similarly, if G contains 1 edge from a 2-triangle {1,2,3}, let it be 12. Then G must contain 3x 
for some x, giving us a\ more intertriangle edges. So assuming all 3ao + a\ edges just enumerated 
are distinct, G has at least 3oo + a± edges not in 2-triangles, for a total of 3oo + 2a\ + 2a2 + 3a3 > ^ 
edges. All that remains to be shown is that all enumerated intertriangle edges are distinct. 

Let 14 be an intertriangle edge; we need to show that G either contains two edges from {1, 2, 3} 
or two edges from {4,5,6}. G contains some edge 2x; if x ^ {1,3,4}, then we can replace 14 and 
2x by 12 and Ax in the matching to contradict LP-optimality of G. So G contains an edge from 
{21,23,24}, and similarly an edge from {31,32,34}, {51,54,56}, and {61,64,65}. 

If G contains 23, it can't contain edge 51, or else we can replace {14,23,51} by {12, 13,45} to 
contradict LP-optimality of G. Similarly, if G contains 23, it can't contain edge 56, or else we can 
replace {23,14,56} by |{12, 13, 23, 45, 46, 56} to contradict LP-optimality of G. So if G contains 
23, it must contain 54, and similarly 64, so we are done. 

If G contains 24, then it can't contain 51 (or we can replace {24,51} by {12,45}) or 56 (or else 
we replace {14, 24, 56} by {12, 45, 56}), so it must contain 54, and similarly 64, and so we are done. 

Similarly, if G contains 34, it must contain 54 and 64. The only remaining case is where G 
contains 21 and 31, in which case it contains two edges from triangle 123, completing the final case 
of the proof. □ 

So both bounds in Theorem II. 21 are in fact tight, and in fact we can get any dimension satisfying 
those bounds by titrating the triangle construction of Proposition ^. 31 and the matching construction 
of Proposition 13.21 
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